Clustering speakers by their voices
نویسندگان
چکیده
The problem of clustering speakers by their voices is addressed. With the mushrooming of available speech data from television broadcasts to voice mail, automatic systems for archive retrieval, organizing and labeling by speaker are necessary. Clustering conversations by speaker is a solution to all three of the above tasks. Another application for speaker clustering is to group utterances together for speaker adaptation in speech recognition. Metrics based on purity and completeness of clusters are introduced. Next our approach to speaker clustering is described and finally experimental results on a subset of the Switchboard corpus are presented.
منابع مشابه
Prosodic and Spectral iVectors for Expressive Speech Synthesis
This work presents a study on the suitability of prosodic and acoustic features, with a special focus on i-vectors, in expressive speech analysis and synthesis. For each utterance of two different databases, a laboratory recorded emotional acted speech, and an audiobook, several prosodic and acoustic features are extracted. Among them, i-vectors are built not only on the MFCC base, but also on ...
متن کاملPerceptual scaling of voice identity: common dimensions for different vowels and speakers.
THE AIMS OF OUR STUDY WERE (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space suc...
متن کاملPerceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news
The average pitch of 68 news broadcasters (34 female / 34 male speakers) was evaluated by 6 expert listeners. Additionally, the average fundamental frequency for all samples was analyzed by means of a series of standard pitch detection algorithms. The results show a strong correlation of acoustic mean and auditory median values for male voices, whereas the auditory mean values female voices are...
متن کاملPerceptive and acoustic measurement of av male speakers in Germ
The average pitch of 68 news broadcasters (34 female / 34 male speakers) was evaluated by 6 expert listeners. Additionally, the average fundamental frequency for all samples was analyzed by means of a series of standard pitch detection algorithms. The results show a strong correlation of acoustic mean and auditory median values for male voices, whereas the auditory mean values female voices are...
متن کاملBuilding personalised synthetic voices for individuals with severe speech impairment
For individuals with severe speech impairment accurate spoken communication can be difficult and require considerable effort. Some may choose to use a voice output communication aid (or VOCA) to support their spoken communication needs. A VOCA typically takes input from the user through a keyboard or switch-based interface and produces spoken output using either synthesised or recorded speech. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998